CDS

Accession Number TCMCG024C56329
gbkey CDS
Protein Id XP_022039342.1
Location join(168806633..168806645,168806722..168806912,168807713..168807896,168808009..168808146,168808253..168808423,168808512..168808615,168808692..168808759,168809233..168809399,168809989..168810058,168810203..168810357,168810440..168810542,168810624..168810711,168810788..168810931,168811116..168811343)
Gene LOC110941952
GeneID 110941952
Organism Helianthus annuus

Protein

Length 607aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022183650.2
Definition serine protease SPPA, chloroplastic [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category OU
Description protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K04773        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGTTCATCAAAGGACAACAATGGCGATCTCAAATCCAAAATGGATGTTAACGGAGGCGAAAATGAGTATCCAAGTGGCGAATTCGAGTACAAAACTCCAACTGCTTGGAAGAGCTTCATGGTAAACCTACGCATGCTAGTTGCTTACCCATGGCTCAGGGTTCGCAAGGGTAGCGTATTGTATATTAAACTGCGCGGCAAGATAACTGATCAATTAAAGAGTCGTTTCTCTTCCGGTTTATCACTACCACAATTATGTGAAAACTTGATTAAGGCAGCGTATGATCCTCATATATCTGGTGTTTATCTCCACATTGAAACCTTGAATTGTGGATGGGGTATAACTGATGAAATCAGAAGGCACATATTGGATTTTAAGAAGTCAGGGAAGTTCATTATCGGTTACGCACCTGTATGGCATGAAAAGGAGTATTACCTCGGATGTGTCTGTGACGAGTTCTACGCCCCTCCAAGTGCCTACTTTTCATTGTATGGTTTTTCTAGAGGAGCGTCGTTTTATGGAGGTGTATTTGAGAAAATAGGTGTGGAACCACAAGTGCATAGGATTGGTAAGTATAAGACTTTTGGCGATATGTTAACTCGCAAGAATATATCCGAAGAAAATCGTGAGGTGCTTACTACAATCCTGGATGATGTCTACGAGAATTGGGTCGATAAGGTTTCTCAAGCCAAAGGAAAGAGTAAGGAAGAAATCAAGAGTTTTATTAATGAAGGAGTTTACCAAATAGATAAGTTGAAGGAAGATGGATGGATAACAGATATCAAATATGATGATGAGGTTAAATCTATGTTGAAAACAAGATTATGCATTGCTGAGAAGAAAAAATTTACACTTATTGAATACAAGAAATACTCGAGAATCAGGAAATGGAGTGTGGGGTTATCAGATGGAAAAGACCGAATTGCGGTAATTAGAGCTTCTGGTAGCATTACTCGTGTAGGAGGGTCGTTTTTTACGCCTAGTTCAGGCATCGTAGCTGAACAATTCATCAAAAAGATTAGCAAAGTAAGAGATTCAAAAAGGTATAAGGCCGTTATCATCCGAATTGATAGCCCTGGGGGTGGTCATGTTGCTTCTGACCTGATGTGGAGGGAAATCAAACTATTGGCAGAATCCAAGCCTGTAATTGCATCAATGGTTGACGTGGCCGCAAGTGGAGGATACTACATGGCAATGGCGGCGAATGCTATAGTCTCCGAGAATCTTACTTTAACGGGCTCAATTGGTGTAGTCTCATTGAATTACAATTCGGAGAAACTATTTGAAAAGATTGGTTTCAACAAAGAAGTTATATCAAAGGGACGATATGCTGAGCTGTTTACCGATAACCGGTCATTCAGACCTGATGAAGAGAAACTGTTTGCGGAGCGTGCCCAGAATATTTACGAACGCTTTCGTGAAAAGGCAGCATGTTCCAGATCAATGAGTGTGGAAGAGATGGAAGAGATAGCTCAAGGGAGAGTATGGAGTGGTAAGGATGCTGCTTCACGAGGTTTAGTTGATGCAATCGGAGGCTTTTCACGGGCTGTTGCTATAGCCAAACACAAGGCCAACATACCTCACAACAAACAGGTCGCACTGGTTGAGCTTTCGAAACCATCACTATCTATACAAAAATTCCTATTTGGCATGTTGAGCTCAGCAATCGGAATAGACAAAACACTAAAGCATCTGCAGGGTGATTTTGCAACGAGCGACGAGGTGCAAGCACGCATGGATGGAGCCATGTTTCATGGGTCAGGAGGATCATCTGCGGTCCCTAATTTCGGTTTTCTAAAAGACTACGTAGCTTCTCTTTGA
Protein:  
MGSSKDNNGDLKSKMDVNGGENEYPSGEFEYKTPTAWKSFMVNLRMLVAYPWLRVRKGSVLYIKLRGKITDQLKSRFSSGLSLPQLCENLIKAAYDPHISGVYLHIETLNCGWGITDEIRRHILDFKKSGKFIIGYAPVWHEKEYYLGCVCDEFYAPPSAYFSLYGFSRGASFYGGVFEKIGVEPQVHRIGKYKTFGDMLTRKNISEENREVLTTILDDVYENWVDKVSQAKGKSKEEIKSFINEGVYQIDKLKEDGWITDIKYDDEVKSMLKTRLCIAEKKKFTLIEYKKYSRIRKWSVGLSDGKDRIAVIRASGSITRVGGSFFTPSSGIVAEQFIKKISKVRDSKRYKAVIIRIDSPGGGHVASDLMWREIKLLAESKPVIASMVDVAASGGYYMAMAANAIVSENLTLTGSIGVVSLNYNSEKLFEKIGFNKEVISKGRYAELFTDNRSFRPDEEKLFAERAQNIYERFREKAACSRSMSVEEMEEIAQGRVWSGKDAASRGLVDAIGGFSRAVAIAKHKANIPHNKQVALVELSKPSLSIQKFLFGMLSSAIGIDKTLKHLQGDFATSDEVQARMDGAMFHGSGGSSAVPNFGFLKDYVASL